On the Correction of "Old" Omitted Citations by Bibliometric Databases

نویسندگان

  • Fiorenzo Franceschini
  • Domenico A. Maisano
  • Luca Mastrogiacomo
چکیده

Omitted citations – i.e., missing links between a cited paper and the corresponding citing papers – are the main consequence of several bibliometric-database errors. To reduce these errors, databases may undertake two actions: (i) improving the control of the (new) papers to be indexed, i.e., limiting the introduction of “new” dirty data, and (ii) detecting and correcting errors in the papers already indexed by the database, i.e., cleaning “old” dirty data. The latter action is probably more complicated, as it requires the application of suitable errordetection procedures to a huge amount of data. Based on an extensive sample of scientific papers in the Engineering-Manufacturing field, this study focuses on old dirty data in the Scopus and WoS databases. To this purpose, a recent automated algorithm for estimating the omitted-citation rate of databases is applied to the same sample of papers, but in three different-time sessions. A database’s ability to clean the old dirty data is evaluated considering the variations in the omitted-citation rate from session to session. The major outcomes of this study are that: (i) both databases slowly correct old omitted citations, and (ii) a small portion of initially corrected citations can surprisingly come off from databases over time. Conference Topic Data Accuracy and disambiguation Introduction An important branch of the bibliometric literature examines errors in bibliometric databases. Several studies show that the major consequence of database errors is represented by omitted citations, i.e., citations that should be ascribed to a certain (cited) paper but, for some reason, are lost (Moed, 2005; Buchanan, 2006; Jacsó, 2006, Li et al., 2010; Olensky, 2013). Franceschini et al. (2013) proposed an automated algorithm for estimating the omittedcitation rate of bibliometric databases. This algorithm requires the combined use of two or more bibliometric databases and is based upon the hypothesis that the mismatch between the citations occurring in one database and another one is evidence of possible errors/omissions. In a further study by Franceschini et al. (2014), this algorithm was applied to a relatively large set of publications, showing that, depending on the bibliometric database in use (Scopus or WoS), omitted citations are not distributed uniformly among publishers; e.g., regarding the publications in the Engineering-Manufacturing field, citations from papers published by Wiley-Blackwell are more likely to be omitted by Scopus, while those from papers published by ASME (American Society of Mechanical Engineers) are more likely to be omitted by WoS. A reason behind this result is that some editorial styles imposed by certain publishers can probably hamper the correct identification of the cited papers by some databases. The presence of database errors, as well as journal coverage or author disambiguation, is probably one of the major concerns of database administrators. In the authors’ opinion, database administrators may undertake two actions for reducing database errors: 1. Limiting the introduction of “new” dirty data in a database, i.e., errors concerning new papers to be indexed; 2. Cleaning “old” dirty data, i.e., errors concerning papers/journals already indexed by a database.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A novel approach for estimating the omitted-citation rate of bibliometric databases with an application to the field of bibliometrics

One of the most significant inaccuracies of bibliometric databases is that of omitted citations, namely, missing electronic links between a paper of interest and some citing papers, which are (or should be) covered by the database. This paper proposes a novel approach for estimating a database’s omitted-citation rate, based on the combined use of 2 or more bibliometric databases. A statistical ...

متن کامل

Scientific journal publishers and omitted citations in bibliometric databases: Any relationship?

. Introduction and literature review Bibliometric databases, like any database, are not free from errors. Despite the improved accuracy over the past ten ears – probably due to the systematic employ of automatic tools for correcting errors in cited article lists by editors and atabase administrators (Adam, 2002) – the problem is far from being solved. This is proven by (i) several recent articl...

متن کامل

Visualization of the Koomesh journal between 2006 and 2017: A bibliometric study

Introduction: The present study was conducted with the aim of analyzing the bibliometrics of Koomesh, as one of the oldest and most reputable Iranian medical journals. Materials and Methods: This study was conducted using a bibliometric method on the articles published in Koomesh during the years 2006-2017. For this purpose, through advanced search in the Scopus database, 764 papers were extrac...

متن کامل

عملکرد مجلات علوم پزشکی ایران نمایه شده در پایگاه گزارش استنادی نشریات

Introduction: With an increasing number of Iranian scientific journals indexed by international databases, assessment of their performance and status is also needed. The present study aims to investigate the international performance of Iranian medical journals. Methods: To conduct the current study, bibliometric research method has been applied. International statuses of 21 Iranian ...

متن کامل

View From Above: Bibliometric and Citation Analysis of Global High Altitude Medicine Research

Introduction: High altitude destinations are popular among international travelers. Travel medicine practitioners should be familiar with altitude physiology and high altitude illness recognition, prophylaxis, and management. We performed the first bibliometric analysis of high altitude medicine research. Methods: All articles published in a specialist hi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015